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Description 



METHOD FOR CALCULATION A PITCH 

PERIOD ESTIMATION OF SPEECH 
SIGNALS WITH VARIABLE STEP SIZE 

Background of Invention 
[0001] i. Field of the Invention 

[0002] The present invention relates to a method for calculating a 
pitch estimation, and more specifically, to a method for 
calculation a pitch period estimation of speech signals 
with variable step size. 

[0003] 2. Description of the Prior Art 

[0004] | n t he past few years electronic wireless communication 

has improved. At the same time the popularityof multime- 
dia systems has increased while the demand for sound 
signal encoding and analyzing has become more and 
more popular. Sound telecommunication is an important 
application in the network of the next generation and has 
also an important role in multimedia telecommunications 



in the network. 

[0005] Telecommunication is widely applied to the techniques of 
sound signal encoding. So the telecommunication of 
specification is quite important. At the moment, there are 
some specifications of the International Telecommunica- 
tion Union: PCM(64Kpbs) s G711(64Kpbs) s G726 
(AD PCM s 16 . 24 . 32 . 40Kpbs),G728(Low Delay CELP 
16Kpbs) s G728(Low Delay CELP 8Kpbs). Currently, the cel- 
lular mobile telephone systems in North American use 
VSELP encoding techniques of the TIA(Telecommunication 
Industry Association). The cellular mobile telephone sys- 
tems in Japan and Europe use RPE-LTP encoding tech- 
niques such as JDCQapanese Digital Cellular) and 
GSM(Global System for Mobil Telecommunication). At the 
moment the current encoding technique is still at 8 Kbps. 
But the encoding technique of a new generation of mobile 
telecommunications is at 4.8 Kbps (LD-CELP)-2.4 Kbps 
(MELP,STC). For achieving such a ratio, the operation com- 
plexity is also raised, so that the general digital signal 
processor is used to finish the immediate operation. 

[0006] For matching the design, there are digital signal proces- 
sors in the special application design for sound compres- 
sion or sound identification. The features of the DSP are: a 



short instruction cycle, high parallelism and a plurality of 
special address modes to resolve the general digital signal 
processing. 

[0007] The step with large amounts of operations in voice pro- 
cessing is the step of pitch estimation. This step is calcu- 
lated according to equation 1. 



R[t]= 5>[*M« + r ] 

„ _ n equation 1 



pitch period = {t | mas[ ^[r]]} 



[0008] 



Equation 1 is the operation of the autocorrelation. X[n]is a 
sound signal comprising a plurality of voice data from x[0] 
to x[N-l]. Voice data x[n+ 



] is a sound signal generated according to sound signal 
x[n] which lags a lag parameter. The sound signal x[n + 



] is from x[ 



] to x[N-l + 



]. R[ 



is a autocorrelation value corresponding to a lag parame- 
ter. R[ 



] is the value that the amount of the voice data in the 
sound signal x[n] times the corresponding voice data in 
the sound signal x[n+ 



]. 

[0009] The autocorrelation operation in the method for estimat- 
ing the pitch estimation, according to the prior art, calcu- 
lates a plurality of autocorrelation value according to each 
lag parameter. Then a plurality of autocorrelation values 
are compared and the maximum autocorrelation value of 
these autocorrelation values are found. The lag parameter 
corresponding to the maximum autocorrelation value is 
used for calculating the pitch estimation. 

[0010] Additionally, the normalizing autocorrelation method can 
also be used for estimating the pitch estimation. Please 
refer to equation 2. 
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period = {r | max[ As [«]} 



The normalizing autocorrelation method calculates the 
value R[ 



] according to equation 2, i.e. the value R[ 



] is calculated according to each lag parameter 



in a plurality of lag parameters 



. The values 



] are stored in a memory and compared, until the maxi- 
mum R[ 



] is found. Then a lag parameter 



corresponding to the maximum R[ 



] is used for estimating pitch estimation. 
[0012] The amount of the operation of these two kinds of meth- 
ods for estimating pitch estimation in digital signal pro- 
cessor is quite large. When the data bulk of the entry 
sound data is larger, the time of data processing is longer. 
When the sound signal cannot be operated immediately, 

the quality of the sound signal will be lowered. 
Summary of Invention 

[0013] it is therefore a primary objective of the claimed invention 
to provide a method for calculating a pitch period estima- 
tion of speech signals with a variable step size. 



[0014] The claimed invention provides a method for calculating 
pitch estimation of a sound signal with a voice processor, 
the sound signal comprising a plurality of sound data, the 
method comprising the following steps:(a) providing an 
initial value to a lag parameter;(b) using the voice proces- 
sor to calculate an autocorrelation value according to the 
lag parameter; (c) storing the lag parameter and the cor- 
responding autocorrelation value in a memory; (d)setting 
a first increment and a second increment; (e)using the 
voice processor to compare the autocorrelation values in 
step (b) with a first threshold value, wherein when the au- 
tocorrelation value is less than the first threshold value, 
the lag parameter is increased by the first increment, and 
when the autocorrelation value is larger than the first 
threshold value, the lag parameter is increased by the 
second increment; (f)repeating the step (b), step (c), step 
(d) and step (e) until the lag parameter is larger than a 
predetermined value; and (g)comparing the plurality of 
autocorrelation values stored in the memory to find a 
maximum autocorrelation value and calculating a pitch 
estimation of the sound signal according to the lag pa- 
rameter corresponding to the maximum autocorrelation 
value. 



Brief Description of Drawings 

[0015] pig.l is a block diagram of a voice processor according to 
the invention. 

[0016] pig. 2 is a flowchart of a method for estimating a pitch es- 
timation according to the invention. 

[0017] pig. 3 is a flowchart of a method for estimating a pitch es- 
timation in the first embodiment in the invention. 
Detailed Description 

[0018] please refer to Fig.l. Fig.l is a block diagram of a voice 
processor 12 according to the present invention. A sound 
signal is an input in a voice processing device 10. The 
voice processing device 10 comprises a voice processor 
12 for processing the sound signal x[n], a memory 14 for 
storing a plurality of lag parameters and autocorrelation 
values R[ 



] calculated by the voice processing device 10 and a 
database for storing the sound signal x[n] and corre- 
sponding pitch range. The sound signal x[n] is generated 
by a sound signal generator 16 and input in the voice 
processing device 10. 

[0019] please refer to Fig. 2. Fig. 2 is a flowchart of a method for 
estimating a pitch estimation according to equation 1 in 
the invention. The method comprises the following steps: 

[0020] step 200: Providing an initial value to a lag parameter with 
the voice processor 12; 

[0021] step 202: using the voice processor 12 to calculate an au- 
tocorrelation value according to the lag parameter 



; the autocorrelation operation can be operated according 
to the above-mentioned equation 1 or equation 2; 
[0022] step 204: Storing the lag parameter 



and the corresponding autocorrelation value R[ 



] in a memory 14; 
[0023] step 206: Setting a first incrementA i and a second 
incrementA ; 
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[0024] step 208: using the voice processor 12 to compare the 
autocorrelation values R[ 



] in step (b) with a first threshold value R , wherein when 

thl 

the autocorrelation value R[ 



] is less than the first threshold value R , the lag param- 
eter 



is increased by the first incrementA^ and when the auto- 
correlation value is larger than the first threshold value R 
, the lag parameter 



is increased by the second incrementA 2 ; 
[0025] step 210: repeating step (b), step(c), step (d) and step (e) 
until the lag parameter 



is larger than a predetermined value; and 
[0026] step 2 12: comparing the plurality of autocorrelation val- 
ues R 



] stored in the memory 14 to find a maximum autocorre- 
lation value R[ 



] and calculating a pitch estimation of the sound signal 
according to the lag parameter 



corresponding to the maximum autocorrelation valu 



]■ 

[° 027 ] In step 200 to step 204, the voice processor 12 is used for 
providing an initial value to a lag parameter 



and calculating an autocorrelation value according to the 
lag parameter 



. The lag parameter 



and the corresponding autocorrelation values R[ 



] are stored in a memory 14. The initial value can be set as 
1 or other value. In step 206 and step 208, a first incre- 
ment A and a second increment A are set at first. The 

1 2 

voice processor 12 compares the autocorrelation values R[ 



] in step (b) with a first threshold value R . When the 

thl 

tocorrelation value R[ 



] is less than the first threshold value R , the lag param- 
eter 



is increased by the first incrementA . When the autocorre- 
7 1 

lation value R[ 



] is larger than the first threshold value R , the lag pa- 
rameter 



is increased by the second incrementA The incrementA 
is less than the incrementA 

1. 

[0028] when the autocorrelation value R[ 



] is larger than the first threshold value R , the lag pa- 
rameter 



is increased by the second incrementA^ The purpose is to 
avoid ignoring the lag parameter 



corresponding to the pitch estimation. When the autocor- 
relation value is larger than a first threshold value R , 

3 thl' 

the lag parameter corresponding to the autocorrelation 
value is close to the lag parameter corresponding to the 
pitch estimation of the sound signal and the second in- 
crement A is increased by the lag parameter 



. The second incrementA can be set as 1 or other value 
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that is less than the first incrementA . When the autocor- 

1 

relation value R[ 



] is less than the first threshold value R , the lag param- 
eter 



is increased by the first incrementA^ The purpose is to 
ignore some lag parameters 



to reduce the amount of the autocorrelation operations . 
When the autocorrelation value is less than a first thresh- 
old value R , the lag parameter corresponding to the au- 
tocorrelation value is not close to a lag parameter corre- 
sponding to the pitch estimation of the sound signal and 
the second increment A i is increased by a lag parameter 



. The second incrementA 2 can be set as a larger value to 
ignore some lag parameters 



to reduce the amount of the autocorrelation operations. 
The first increment can be adjusted according to a differ- 
ent system. 

[0029] | n ste p 210, steps 202-208 are repeated. A plurality of 
autocorrelation values are calculated and stored in the 
memory 14 with a plurality of lag parameters. Because the 
autocorrelation is used for finding the level that the sound 
signal is similar to itself. When the sound signal is a cycle 
sound signal, the steps 202-208 are repeated until the lag 
parameter 



is larger than the cycle number of the sound signal x[n]. 
When the sound signal is not a cycle sound signal, steps 
202-208 are repeated until the lag parameter 



is larger than the number of the sound signal x[n]. 
[0030] The autocorrelation operation for the non-cycle sound 
signal (ex: the noise or the sign) the autocorrelation val- 
ues R[ 



] or the square of the autocorrelation values 



] cannot be used as the reference data for pitch estima- 
tion. Because the autocorrelation operation is used for 
finding the similar level between the sound signal and it- 
self, a plurality of autocorrelation values of the cycle 
sound signal are showed in a regular pattern for finding 
the pitch estimation so that the pitch estimation can be 
found among the plurality of autocorrelation values. The 
autocorrelation values of the non-cycle sound signal are 
not showed in a regular pattern for finding the pitch esti- 
mation so that the pitch estimation of the sound signal 
cannot be found among the plurality of the autocorrela- 
tion values. In the embodiment, the autocorrelation oper- 



ation is only operated in the cycle sound signal to find the 
pitch estimation. 
[0031] | n s tep 212, the voice processor 12 is used for comparing 
the plurality of autocorrelation values R[ 



] stored in the memory 14 to find a maximum autocorre- 
lation value R[ 



] and calculating a pitch estimation of the sound signal 
according to the lag parameter 



corresponding to the maximum autocorrelation valu 



]. 

[0032] The amount of the autocorrelation operations in the in- 
vention is less than the amount of the autocorrelation op- 
erations according to the prior art. The autocorrelation 
values are calculated according to each lag parameter 



of a plurality of lag parameters 



. The lag parameter 



is increased by the first increment A i or the second incre- 
ment A 2 in the invention. When the lag parameter 



is increased by the first increment A i or the second incre- 
ment A 2 , the lag parameter between the lag parameter 



and the lag parameter 



+A or the lag parameter 



+A 2 are omitted. The autocorrelation values correspond- 
ing to the omitted lag parameters can be set as zero or as 
a smaller number. 
[0033] | n the invention, a third increment or a plurality of incre- 
ments can be set. The autocorrelation values in the step 
202 are compared with a second threshold value R 

K th2 

[0034] The second threshold value R is larger than the first 

th2 3 

threshold value R . When the autocorrelation value R[ 

thl 



] is less than the second threshold value R and larger 

th2 3 

than the first threshold value R , the lag parameter 

thl 



is increased by the second incrementA . When the auto- 
correlation value R[ 



] is larger than the second threshold value R the lag 

M th2, ^ 

parameter 



is increased by the third incrementA . 

1 3 

[0035] please refer to Fig. 3. Fig. 3 is a flowchart of a method for 
estimating a pitch estimation in the first embodiment of 
the invention. The embodiment is implemented in the 
voice processor 10. 

[0036] step 300: Providing an initial value to a lag parameter with 
the voice processor 12; 

[0037] step 302: using the voice processor 12 to calculate an au- 
tocorrelation value according to the lag parameter 



; the autocorrelation operation can be operated according 
to the above-mentioned equation 1 or equation 2; 
[0038] step 304: Storing the lag parameter 



and the corresponding autocorrelation value R[ 



] in a memory 14; 
[0039] step 306: Setting a first incrementA i and a second 
incrementA ; 
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[0040] step 308: using the voice processor 12 to compare the 
autocorrelation values R[ 



] in step 302 with a first threshold value R , wherein 

K thl 

when the autocorrelation value R[ 



] is less than the first threshold value R , the lag param- 
eter 



is increased by the first increment A , and when the auto- 
correlation value is larger than the first threshold value R 
, the lag parameter 



is increased by the second increment A^, 
[0041] step 310: when the lag parameter 



is larger than a predetermined value, step 312 is imple- 
mented; when the lag parameter 



is less than a predetermined value, step 302 is imple- 
mented; and 

[0042] step 3 12: comparing the plurality of autocorrelation val- 
ues R[ 



] stored in the memory 14 to find a maximum autocorre- 
lation value R[ 



] and calculating a pitch estimation of the sound signal 
according to the lag parameter 



corresponding to the maximum autocorrelation valu 



]. 

[0043] The amount of the autocorrelation operations in the in- 
vention is less than the amount of the autocorrelation op- 
erations according to the prior art. The autocorrelation 
values are calculated according to each lag parameter 



of a plurality of lag parameters 



. The lag parameter 



is increased by the first increment A i or the second incre- 
ment A 2 in the invention. When the lag parameter 



is increased by the first increment A i or the second incre- 
ment A 2 , the lag parameter between the lag parameter 



and the lag parameter 



+A or the lag parameter 



+A 2 are omitted so that the amount of operations can be 
reduced. And the lag parameter increases less for the sec- 
ond increment to avoid omitting the interval that the 
pitch estimation is probably in. 
[0044] Those skilled in the art will readily observe that numerous 
modifications and alterations of the device may be made 
while retaining the teachings of the invention. Accord- 
ingly, the above disclosure should be constructed as lim- 
ited only by the metes and bounds of the appended 
claims. 



